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Abstract. We introduce formal languages over infinite alphabets where words 
may contain binders. We define the notions of nominal language, nominal monoid, 
and nominal regular expressions. Moreover, we extend history-dependent au- 
tomata (HD-automata) by adding stack, and study the recognisabUity of nominal 
languages. 

1 Introduction 

Automata over infinite alphabets have been receiving an increasing amount of attention, 
see eg [11, 15, 1,17]. In these approaches, the countably infinite alphabet fA^ can be con- 
sidered as a set of 'names', which can be tested only for equality. Typically, languages 
of interest such as 

Li = {ni...nk e 9\C* I 3/ ^ j.ni = nj} (1) 

from 1 11 J are invariant under name-permutations: If eg nmn is in the language, then so 
is n'mn' = {nn!)- nmn, where {nn')- nmn stands for the application of the transposition 
(n n') to the word nmn. This suggests to think of the names as being bound and lan- 
guages to be closed under a-equivalence. On the other hand, we may fix a name n\ and 
consider the language 

= {nin2...nk G \ j .ni^nj) (2) 

from [17]; we can think of n\ as a free name and of the ni, • • -nt as bound. This suggests 
to study not only words over names, but also words which contain binders and allow 
us to make explicit the distinction between bound and free names. Automata on words 
with binders already appear in [16] in the study of the ^-calculus. In this paper we begin 
the systematic study of words with binders from the point of view of the classical theory 
of formal languages and automata. 

In particular, our contributions are: 

- nominal languages of words with binders (§ 2) as a natural generalisation of formal 
languages over infinite alphabets; 

- nominal monoids (§ 3) as the corresponding algebraic structures; 

- nominal regular expressions (§ 4) as a generaUsation of regular expressions; 
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- HD-automata with stack (HDS) (§ 5) and Theorem 2 showing that nominal regular 
expressions can be faithfully encoded into HDS. 

One of the motivations to study words with binders comes from verification. For in- 
stance, consider the Needham-Schroeder protocol 

A ->B : {n,A}B 
B ^A: {n,m}A 
B : {m}B 

The (correct) runs of the protocol can be characterised by a nominal regular expression 

{n. ENCR n A FOR B {m. EMCR n m FOR A (EMCR m FOR B) ) ) * (3) 

where the alphabet is now 9{_US with n,m&9{^ and {ENCR, FOR, A , B} = S & finite set 
of 'letters'; finally, (n.e) binds all the free occurrences of n in e and generates afresh 
name n. From (3) one could obtain an HDS for monitoring the execution of a protocol, 
i.e. the HDS would be able to detect if something goes wrong during the execution 
(e.g., an intruder is performing an attack). From an automata theoretic point of view, 
the interesting new feature appears more clearly if we abstract (3) to 

{n.n{m.nm))* (4) 

and note that binding (fresh name generation) (_._) appears under the Kleene star, which 
is the reason why automata accepting such languages need to have a stack. 

2 Nominal Languages 

We introduce languages with name binders. This section appeals to our intuitive under- 
standing of binding and a-equivalence as known from eg ^.-calculus or first-order logic, 
but see the next section for a formal treatment. To start with, the alphabet is divided 
disjointly into a countably infinite set fhC (of names) and a finite set S (of letters). 

Definition 1 (m-word). An m-word is a term built from constants f^V^U^ U {e}, and 

two binary operations o, {(_._», according to 

def I , I , 

w = t\n\s\ wow I ((n.w», 

where n ranges over and s over S. We denote by M the set of all m-words. 

As in the classical case we assume that e (the empty word) is the neutral element wrt 
o and that o is associative. We often write wv for the concatenation vi- o v. Furthermore, 
we let {{n.Wi bind the free occurrences of n in w and take m-words up to a-equivalence. 

The notion of m-word is the most general notion of word with binders: We only 
require from words to form a monoid and behave well wrt a-equivalence. Due to the 
scope introduced by binding, words now have a tree structure. This motivates the fol- 
lowing, more special, but perhaps more naturally generalised, notion of words. 
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Definition 2 (g-word). A g-word is a term built from e, unary operations S-for each 
n G IAC,s G S, and a binary operation «_._», according to 

def I I I 

w = e I nw I sw \ {«.w». 
We denote by G the set of all g-words. 

Regarding binding and a-equivalence, we follow the same conventions as for m- words. 
To consider G as a monoid, we define o : G x G G as follows: 

def def / 

eow = w nwov = n(wov) 

def , . def , , , . 

SWOV = S{W0V) m.WiOV = ((M.(wovj)) 

where n' is fresh for v and «n'.w'» is an a-renaming of m-W). Intuitively speaking, we 
extrude the scope of the binding to the end of the word. 

Next we allow binders to appear only at the beginning of a word. 

Definition 3 (1-word). An 1-word is a pair {p,w) where p G 9C and w G (fA^U5)*. We 
denote by L the set of all l-words. 

We interpret p as a prefix of name binders and w as the part of the word that has no 
binders, o ; L x L L is given on the left below 

def def 

{p,w)o{q,v) = {pq,wv) [n\{p,w) = {np,w) (6) 

where we assume that p and q, p and v, and q and w have no names in common. 
Whereas previously name-binding was built into the syntax via (_._ , we now define 
explicitly, anticipating notation from § 3, a binding operation [_]_; £^/^ x L ^ L via the 
clause on the right of (6). 

Definition 4 (s-word). An s-word is a pair {S, w) where w G (fA^U5)* and S is a subset 
of the names appearing in w. We denote by S the set of all s-words. 

On S, we define the two operations o and [_]_ as follows, assuming that S and T, S and 
V, T and w have no names in common. 

^ ^ def ,„ ^ ri.„ ^def j {S U {n} ,W) If « lU W 

{S,w)o{T,v) = {SUT,wv) [n\{S,w) = i . (7) 

1(5, w) otherwise 

sl Ifi gm 

Remark 1. We have embeddings S ^ L G ^ M. For sl we assume that names are 
ordered; the other main clauses are lg{np,w) = {{n.lg{p,w)} and gm(nw) = nogm(w). 



3 Nominal monoids 



The somewhat informal treatment of § 2 should be sufficient to understand how au- 
tomata process words with binders in § 5 and § 6. On the other hand, from a conceptual 
point of view, it is important to have a unifying account. The presence of names and 
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binders suggests to employ nominal sets [8]. This not only provides us with a math- 
ematical theory, but also a clear conceptual guidance: Follow the classical universal 
algebraic account of languages and automata, but replace sets by nominal sets. Here, 
we apply this to languages and monoids. 

Nominal sets and their logics come in different versions. We follow [9], for which 
we need to refer to for details. More details can also be found in [12]. Let us just recall 

Definition 5 (Nominal set). Denote by Perm{9\C) the group of permutations of 9\C gen- 
erated from the set of transpositions {{n m) \ n,m & fAQ- A set A equipped with a 
Perm{9Q-action Perm{9{) x A — >■ A is a nominal set, if every element in A is finitely 
supported. This means that for each a € A there is finite set S C (called a support 
of a) such that n\s = id n- a = a for all n € Perm{9i£) (where %\s denotes the 
restriction ofn to S). Maps between nominal sets are required to be equivariant, that is, 
they respect the permutation action. 

It follows that each element a G A has a minimal support supp{a) and one writes n#a 
(n is fresh for a) for n ^ supp{a). This allows us to define abstraction [8, Lemma 5.1] 

def def 

as [n]a = {{n,a)}VJ {{m,{nm) ■ a) \m#a} dind[9{]A = {[n]a\n & 9{^,a & A}. 

A nominal algebra 21, see [9, Def 4.13], consists of a nominal set A, constants 
« G iV^, and a map [9(\A ^ A. As in universal algebra, further operations and equations 
may be added: 

Definition 6. A nominal monoid is a nominal algebra 21 with additional constants s&S 
and (equivariant) operations e, o so that (A, e, o) is a monoid. 

We say that w e A is closed, or that w contains no free names, if supp{w) is empty. 

Definition 7. Write Cm for the class of all nominal monoids. We consider the following 
axioms where m,n€:9(j s&S, andX, Y are variables ranging over carriers of algebras. 



Cg, Ci, Cs are axiomatised by Axl, Axl-3, Axl-5, respectively. 

Remark 2. One possible reading of the operations and the axioms is as follows. In M, 
we have sequential composition o, allocation «« of a resource named n, and deallocation 
)). In G, we don't care about deallocation (garbage collection). In L, the timing of the 
allocation does not matter and all resources may be allocated at the start. In S, the order 
of allocation does not matter and the allocation of an unused resource is redundant. 

But other interpretations are possible. With [n\ as the V« of the 7t-calculus and o as 
|, Ax6 becomes the familiar law of scope extrusion. Interpreting [n] as V, Ax4-5 are 
famiUar laws of the universal quantifier. In [14], a binder satisfying Ax4-5 is called a 
name-restriction operator. 

We can now simmiarise the previous section conveniently in Table 1 and 
Theorem 1. M,G,L,S are the initial monoids in, respectively. Cm, Cg, Ci and Cs- 




Ax2 h so[m]Y = [m]{soY) 




Ax6 n#X h Xo[n]Y = [n] (XoY) 
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Table 1. Summary of nominal monoids and the axioms 



Classes 


Axioms 


Initial monoid 


Typical example 


Cm 




M 


[ni]{sinin4)[rH)]{no[ni]s2) 


Q 


Axl 


G 


[ni]{sinin4[no]{n()[n2]s2)) 


Ci 


Ax 1-3 


L 


[nil[nol[n3l-Vl«in4"O.V2 


Cs 


Axl-5 


S 


[no][ni]sinin4noS2 



Proof. The detailed proof can be found in [12]. □ 

Remarks. We have a mapping /m : 2^^ — !■ 2'^'^^-^'* to plain words (ie words without 
binders) determined by fM{{in.Wi}) — /m({w}) U {(n m) • v | v e /m({w}),ot#v}. With 
the embedding gm o Ig o si from Remark 1 this induces a map /§ from languages of 
s-words to subsets of (lA^UJ)*, eg /s({({n},n)}) = /]«({((«•«))}) = iK- 

4 Nominal Regular Expressions 

In analogy to the classical definition, we introduce nominal regular expressions: 

e ■.■.= 1 \ 0\ n \ s \ e + e \ eoe | {n.e) \ e* (8) 
where nG9{_ and s gS. The semantic interpretation L is defined as follows. 

1. L(l) {e}, L(0) 0, Lin) ^ {«}, L{s) {s}, 

2. L{ex+e2) = L(ei) UL(e2), 

def def 

3. L{e\oe2) = L{e\)oL{e2) = {wiow2\wi £L{ei),W2&L{e2)}, 

4. L{{n.e)) = [n]L{e) = {[n]w \ w e L{e)} . 

5. L{e*) = ML(e)', where L(e)' = L(e) o • • • oL(e), 

itimes 

Remark 4. The definitions of o and [_]_ are dependent on the choice of row in Table 1, 
compare (5), (6), (7). For example, on M we have \n]L{e) = {in.wj, \ w G L(e)} and on 
L we have [n]Z'(e) = {{np,w) \ {p,w) e L[e)}. From § 5 onwards, we will interpret 
regular expressions in M only. 

Example 1. We have seen in (4) how {n.n{m.nm))* arises from the Needham-Schroeder 
protocol. In § 6 we consider the simpler expression m{{n.mn))* which intuitively rep- 
resent the computations of a security protocol where (an unbound number of) new 
'nonces' n are generated within a session m and should always be paired up with m. o 



We can also interpret nominal regular expressions in plain words. For example, let 
L take values in S and let /s : 2^ ^ 2(^^'^)' denote the map of Remark 3. 
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Example 2. If we interpret {n.n)* in S (or L or G) we obtain the language 



^ = fs{L{{n.n)*)) = {ni---nk\yi^j.ni^nj}, 



which is the complement of Li from ( 1 ). £2 is not recognised by the FMAs of [ 11 ] but it 
is recognised by the FRAs of [17]. The latter notes that £2 * = {wv | w,v e L2} shows 
that languages recognised by FRAs are not closed under composition. On the other 
hand, the presence of binders allows us to use o (respecting the 'hidden' binders) instead 

of * and we obtain L20L2 = fs{L{{n.n)*)) o fs{L{{n.n)*)) = fs{L{{n.n)* o (n.n)*)) = 
fs{L{{n.n)*)) = Li, where the second equality is our definition of 'nominal concate- 
nation' on languages of plain words. This indicates that even for languages without 
binders the composition with binders is a natural concept. 

Similarly, if we interpret e = {I .lo{m.{n.mon))*) in M we obtain another example 
of Tzevelekos: 

fM{L{e)] = {mn\n\n\nl---ninl | Vj € N,V7 e {1,2}. m 7^ n/&n,? ^n}} 
5 History-dependent Automata with Stack 

We build our nominal automata theory on HDA (after history-dependent automata) [13]. 
HDA are a computational model of nominal calculi defined on the notion of named sets 
and extend classical automata with finite sets of names local to states and transitions. 
We equip HDA with stack; this renders them suitable for recognising nominal lan- 
guages interpreted in M. We argue that HDA are natural candidates to build a theory 
of automata of nominal languages with binders. In fact, they are equipped with mecha- 
nisms to capture name restriction of nominal calculi [3, 5, 4] and formally linked to the 
nominal set theory in [10, 6]. 

Let ★ ^ be a distinguished name; a stack E is a sequence of finite partial maps 
a : — >^ iV^U {★} (we use _L to denote the empty map). The empty stack is denoted by 
S, a stack with head a is written a :: Z, and 



respectively are the pop, pop twice, and top operations. 

Definitions ([13]). A (basic) named set (2, is a set Q (of states) with a map 
\-\q '■ Q ^ 'I'a 'Ni sending q Cz Q to a finite set of names \q\Q (called local names of ^j. 

Basically, the elements qofa named set are equipped with a set of local names \q\Q. 
Hereafter we omit subscripts when clear from the context and write a named set (2, | _| g) 
as Q, in which case |_| is understood as the map of local names of Q\ also, the update 
of a map f:X^Yatx with y is the map 




f[xi-^y]:XU{x} ^YU{y} such that {f[xi-^y]){a) = 
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Before giving the formal definition, we intuitively present HDA with stack. A tran- 
sition {q',a,a) from a state q consists of the target state q', a label a, and a map a 
keeping track of the correspondences of names. Labels a can be a local name n € |^| of 
the source state q, letters s € J, or any of the distinguished symbols 



respectively representing internal transitions, push, pop, name allocation, and name 
deallocation. Example 3 gives a convenient graphical representation of an HDS. 

Example 3. Let qo, q, and q' be states with |qo| = {x}, |q| = {z}, and q' = 0. The HDS 



has initial (resp. final) state qo (resp. q'). Both qo and q have a transition exposing their 
(unique) local name (x and z respectively). Maps among local names are represented by 
dashed arrows. Also, q has ~r\ transition to q' with the empty map of local names. o 

Definition 9. A (non-deterministic) history-dependent automaton with stack on fA^U5 
(HDS) is a tuple {Q,qo,'r\,F,tr) where 

- Qis a named set of states (the states of the automaton); 

- qo&Q is the initial state; 

- r[is a partial function from Iq'oIq to 9{^; 

- F CQis the named set of final states with \-\f being the restriction of\-\QtoF; 

- tr is the transition function returning for each q€ Qa finite set tr(q) of transitions, 
namely tuples {q',a,o) such that 

• if a e !AC then a e \q\Q 

• if a = I then O : \q'\ \q\ U{*} 

• if a =r\ then a : |^'| -> !J\C 

• otherwise a : \q'\ — >■ \q\ 

and, in either case, a is a partial injective map (see Remark 6 on page 11). 

Transitions in Def 9 allow HDS to accept names or letters or to manipulate the stack. 
Besides the usual push {r\) and pop (-r^) operations, HDS feature allocation («) and 
deallocation (») of names. 



Example 4. Let Q = {qo,q,q'}, F = {q'}. The HDS H = {Q,qQ,'!\,F,tr) where 



e 



rv 




qo H> {x} 
J q ^ {z} 
q'^0 



qoH>{(q,x,a:z->-x)} 
tr: ■|qh^-{(q',^,_L),(q,z,ai:z^-z)} 
q'^0 



formally defines the HDS in Example 3 (where r] is not represented for simplicity). <> 
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We now define how HDS can recognise languages of M. Hereafter, we fix an HDS 

^ = {QAQ,^,F,tr) (9) 
and, for any stack Z and any name mapping a, we define Z • a by 

Q»a = a::8 and E-a = (eT)[* *] oa :: (Z) 

that basically updates E by post-composing its top map (if any) with a. Note that 
this requires E^ to be updated to allow composition when ★ g cod{a). 

A configuration of 9{ in (9) is a triple {q, >v,E) where <7 G 2, w is an M, and E is a 
stack. Call initial a configuration ((j'o, w,r| :: 8) and accepting {q,E,T.) if q E F. 

Definition 10. Given q,q' (z Q and two configurations t = {q, w, E) and t' = {q' , vv' , E'), 

!H in (9) moves from / to t' (written t — )■ t') iff there is {q',a, a) e tr{q) such that either 
of the following cases applies 

ae|^| w = nw' A ET(a) =n A E' = E»a 

a = s&S w = iw' A E' = E»a 

a = e w' = w A E' = E»a 

a=r\ w' =M' A E' = o :: E 

a =vr^ =^ w' = w A E' = o' :: E where a' ~ T, oa 
a = i => w = in.w' A E' = o' :: E, where a' = H> n]) oa 

a = )) w = A E' = a' :: E , w/iere a' = E o a 

77je ^e? recj{{t) of states reached by Hfrom t onw is defined as 



def 



= 



{q} ift = {q,e,'L) 

U M , r'ecy{{t') otherwise 



A run of i?/^ on an m-word w is a sequence of moves of ^ from {qQ,w,\\ :: 8). 

Intuitively, HDS "consume" the word in input moving from one configuration to another 
(likewise classical automata). However, when the current word starts with a name n, the 
automaton can progress only if the name "is known"; namely, it is necessary to find a 
transition from the current state q for which the stack maps a local name of q to n. 

HDS use a stack (/) to keep track of the names of the current state and, noticeably, 
(ii) to (de)allocate bound names in input strings. More precisely, a binder is consumed 
using a i transition which updates the meaning of the names. This is basically done 
by post-composing the mapping o in the selected transition with the map on the top 
of the stack (opportunely updated to take into account the allocation of n). Instead, a » 
transition will pop the stack so reassigning previous meanings to names in the current 
state by post-composing the map o of the transition with "the second one" in the stack. 

An automaton IhC recognises w if it has a run from its initial state to a final state that 
consumes w. 
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Definition 11. The HDS H in (9) accepts (or recognisesj w if F r\rec<^{{qo,w,r\ :: 
8)) ^ 0. The language of (written L^i) is the set of words accepted by !H. 

Examples. If H is the HDS in Example 4 and r] : x i-)- n, then Xh = {n' | j > 0}. o 

Defs 10 and 11 contain some subtleties worth spelling out. First, observe that the 

language recognised by 9f depends on r| which intuitively sets the meaning of the local 
names of the initial state ^o; instead, the language of !>{ does not depend on the identities 
of the local names of the states in !H. Secondly, an alternative definition would allow 
the initial stack to be empty and the correspondence between local names of the states 
of !}{ and those in the input word is incrementally built during recognition. This class 
of HDSs would be equivalent to the one in Defs 9 and 10, but it would have made our 
constructions more complex. Finally, as for classical push-down automata, we could 
have equivalently required that an HDS recognises an m-word w only when it has a run 
leading to a final state that consumes w and empties the stack. We opted for Def 11 as it 
is conceptually simpler. For instance, the following lemma (used to prove Proposition 3) 
states that only the top of the stack is relevant for accepting words. 

Lemma 1. Any configuration reachable by an HDS as in (9) from {qo,w,r\ :: Q) is also 
reachable from {qQ,w,'X\ : : Z) for any stack Z. □ 

In § 6 we show how a nominal regular expression e can be mapped on an HDS (\e\) 
that recognises the language of e. Theorem 2 is the main result 

Theorem 2. For each nominal regular expression e, //(|(.d = L(e) interpreted on M. 

Proof. The proof is by induction on the structure of e. The base cases are trivial while 
the other cases follow by Propositions 1, 2, 3, and 4. □ 

6 HDS and Nominal Regular Expressions 

We use nominal regular expressions (8) to establish a correspondence between HDS 
and nominal formal languages. More precisely, we give (Def 12) the map mentioned in 
Theorem 2 as the homomorphic image of nominal regular expression on an algebra of 
HDS given in the rest of this section. 

Definition 12. The map (|_D from nominal regular expressions to HDS is defined as: 



dlD = {{qo,q},qo,-^,{^}^'io^ {{q,^,-^)}) where \qo\ = \q\=V) 
W = ({9o},^o,-L,0,?oH-0) where \qo\=® 

H = {Uo,q},qo,x>-^ n,{q},qo>-^ {{q,x,±)}) where \qo\ = {x},\q\ =0 
W = {{qQ,q},qQ,-L,{q},qo'-^ {{q,s,±)}) where \qo\ = 1^1 =0 

I\ei+e2\) = ^eiHW 
deioezD = deiDodeiD 



where the operations on HDS in the last four cases are defined in the following. 



<\{n.e)\l 
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The operations on HDS in Def 12 allow to combine them so that the language of the 
resulting HDS has a clear relation with those the operations act upon as per Proposi- 
tions 1, 2, 3, and 4 below. Theorem 2 can be proved by induction on the structure of 
nominal regular expressions using such propositions. 

Remarks. The map (|_D in Def 12 depends on the choice of local names; however, as 
noted in § 5, recognisability does not depend on the identity of such names. 

The first two clauses in Def 6 do not involve names and stack. Notably, the third 

clause states that the HDS corresponding to an expression n has simply a transition 
from the initial to accepting state and in the initial configuration the unique name of the 
former is mapped to n. 

The set of (local) names of an HDS !H as in (9) is defined as = (J^eg ki- 
ln the following, we fix two HDS 

^ = {Qi,qo,i,'(\i,Fi,tri) for i e {1,2} (10) 

and, without loss of generality, we assume that 2i fl ^2 = and | PI = 0. 

Definition 13. Let ^ Ci U Qi be a new state. We define y{i+y{2 to be the automaton 
9{+ = {Q+,q^,y\+,F+,tr+) where 

- g+ = giUg2U{gJ} where l^'J Ig+ = ko.ilgi U |^o,2|e2 and F+ = F^yj F2 

- f'"^(?o ) = {(^o.i-ei^^o.,!) I forie{\,2}} and tr+\Q.=tri fori 2], where 
id\qf,i\ is the identity from l^o./lg; to |i2+ 

- T*" = Til +112, namely r[+{x) = r[i{x) ifx e l^cda- 

Proposition 1. Lj{+ = Lj^^ U Lj{^ 

Proof. The statement trivially follows from Def 10 as (i) qo has only two outgoing e- 
transitions which lead to the initial states of either of or and (ii) r] preserves the 
name assignments rji and r[2- □ 

Lemma 2. For each HDS there is an HDS !H' with a unique final states and such 
that = Lj^i. 

def 

Proof Given i?/ in (9) and such that 1^1 =0, we define i?/' = (gU{^},^(),ri,{^},fr') 
where tr^^q) = 0, tr' = tr when restricted Xo Q\F, and tr'[q) = tr{q) U {(^e,^)_L} for 
each q gF. The proof that Ly{ = L^, is similar to the proof of Proposition 1. □ 

Lemma 2 allows, without loss of generality, !}{ in (9) and each of and in (10) to 
have a single final state, namely F = {q/}, Fi = and F2 = {qf,2}, respectively. 

The following construction extends the names of an HDS without altering its lan- 
guage and is used in Def 15. 

Definition 14. Given fH as in (9) and x G 9{^\ \!H\, = {Q' ,ql,y\\F\tr^') is the 
HDS such that 

- is the named set having the same elements ofQ with j-lgt -.q^ \q\q^ {A 
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- F' is the named set with the same states ofF and |_|ft : q i— )■ \q\Qt U {x} 

- tr''{q) = {{q' ,a,a[xi-^ x]) \ {q' ,a,a) G tr{q)} 

- T]^ : l^lg U {x} 9\C is the partial map undefined on x and behaving as t] otherwise. 

Hereafter, we assume that x ^ 9{W9{\ when writing 9{^x; in fact, by the locality of the 
names in the states of an HDS, if ^ is a state of !>{ such that x S \q\, we can replace x 
with any name not in |^| by rearranging all the maps in the transitions reaching q. 

Lemma 3. L,j{^j^ = Lj^. 

Proof. The proof that Ly{ C Lj^^^^ is trivial as all the transitions of H have a corre- 
spondent in "H^x with exactly the same labels and name mappings. The converse also 
hold trivially as x cannot play any role in the recognition of a word in H'sx as rj' is not 
defined on x. □ 

Definition 15. Let{x\,. . . = |^o,2| (• • • (^it^i) • • • t).^; = (2'i,?o,i,il',{?/,i}>f ')■ 
TheHDSHxoH2 is defined as {Q" ,4q^,'(\° ,Fi,tr°) where Q° =Q[[JQ2and 




xe 1^0.2 1 

otherwise 



tr°{q) 



t/iq) 
tri{q), 

tr'{q)yj{{qo:i,Z,id\qo,2\)}^ 



1 = 



The HDS ^{[O^iih built by connecting the accepting state of 9{\ to go,2, the initial state 

of !}{2, after adding |<7o.2| to 9-(\. Note that the newly introduced e-transition maintains 
the initial meaning of the names in \q(}2\ since r|° acts as r|' on 1(70,21 (and by Def 14). 

Remark 6. A definition more complex than Def 15 can be given to preserve the injec- 
tivity of the initial mapping r[° when rji and TI2 are injective. This requires to relax 
the injectivity condition on a in Def 9 requiring o(x) ~ a(y) <;=^ o(x) = ★ for any 
x,y € domip). We opted for the simpler Def 15 as it just allows more non-determism 
without altering the expressiveness of HDS. 

Proposition 2. = Lh^ o Lh^. 

Proof. The automaton iHiolHi reaches a final state iff w = wiWi where w, e L^^ for 
i = 1,2. In fact, to reach qj^ it is necessary to reach qjw first and the unique transition 
from qf^i to ^0,2 maintains on the stack the meaning assigned to the names |^o,2| as per 
the stack. □ 



Definition 16. 

is such that 



Ut H he as in (9) with F = {qf}. The HDS H* = {Q,qo,y\,{qf},tr*} 

tr*{q) = tr{q), for all q e Q\{qo,qf} 
tr*{qo) = tr{qo){J{{qf,e,±)} 
tr*{qf) = {(^o,^,Tl)} 

The construction of 9-(* simply adds an e-transition from q^ (the initial state of Of) to 
qf (the accepting state of OC) and a r>-transition from qf to ^0 that re-establish the 
mapping of the initial configuration preserving in the stack the meaning of the names. 
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Proposition 3. Lj{, = 

Proof. (Sketch.) First, observe that trivially e S L<^, n X-^ because !>t* has a transition 
(^,e,_L) from go- 

We now prove that L.x> Q L*^. If w 7^ e G then 9-[* reaches a configuration 
{q, e, E) for a suitable E. By construction and Def 10, !>{* can visit q only a finite number 
of times k. Hence, w = wi o • • • o where w, is the word processed between the j'-th 
visit of q and the previous visit of q (or of qo if ; = 1). 

Observing that each visit of q is preceded by a visit of qf (since q can only be 
reached trough qf), we have that w\ G Lj^ (and hence in X^) because there qf can be 
reached from the configuration ((j'OjVVi,!! :: Q). For the same reason, we can conclude 
that w,+i e for each ; e {1, 1}; in fact, the i-th visit of q yields i?/* in the 
configuration (^,w,+i o . . . o Wjt,E) for some stack Z. Hence, using the unique transition 
((^o.^jTl) from^, the automaton "resets" to the configuration (<7o, Wi+i o • • -ow*,!! "E), 
which basically amounts to say that w; can be recognised by H and the next work w,+i 
is processed from a configuration where r| is on the top of the stack and the thesis 
follows by Lemma 1 . 

We prove that L*^^ C L^, . Any word w G has the form w = wi o • • • o where 
w, e -L^ for each i e {1, . . . , A:}, so we proceed by induction on A:. If A: = the the- 
sis follows trivially. If A > then, from the configuration ((^OjWi o W2 o . . . o vi'i ,r| :: 8), 
H* reaches a configuration (g'/,w2 o . . . o Wi:,E) since wi G Lj{ by hypothesis. Since 
(^,e,_L) G tr'{qf), the configuration (§,W2 0. . .ow^,Z»_L) is reachable from i?/"* . There- 
fore, 9{* reaches the configuration {qo,W2 o . . . o wt,il :: Z» _L) which yields the thesis 
by Lemma 1 . □ 

Definition 17. Letn Ef]\C,y{be as in (9) withF = {qf}, and let q,qf ^ Qbe new states 
with \q\ = |go|\il"'(«) ("^d \qf \ =0. The HDS [n]^ ^ (2U{^,^/},^,ri||^|,{^/},/r') is 
such that 

tr'{q) = {{qo,i,o)}, tr' {q) = tr{q) , yq€Q\{qf}, tr'iqf) =tr{qf)U{{qf,),±)} 
where a = id|^„| [x i->- i/rj"' (n) = {x}, otherwise a = id|^„|. 
Proposition 4. X[„]j/ = [n]LH. 

Proof. By construction, (^,w,ri||^| :: 8) reaches qf iff there is a word w' such that 
w = m.w' and {qQ,w' ,0') reaches reaches qf where a' is built as in Def 10. Again 
by construction, this is possible iff (t/o. vi/.o') visits qf and the last transition which 
consumes the word is a (deallocation) j-transition from qf to qf. This is equivalent to 
say that there is w" G such that w' = w"} which, by Remark4, yields the thesis. □ 

7 Mapping Nominal Regular Expressions to HDS 

We build the HDS 

m((n.mn))*D Corresponding to the expression m((«.mn) )* by applying 
the constructions of § 6. By Definition 12, the HDS corresponding to the expression m 
is -^^mH with 

W = ^m\i = {{^0,m,^f,m},^0,m,^m,{'^f,m},trm) (H) 
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where |qo,m| ={x}, |q/,m| =0,ilm: xH>m,fr„: q/,„ i-^ 0, and fr„ : qo,m i-^ {(q/,m,x,-L)}. 
Analogously, the HDS corresponding to the expression n is !Hi^„^ with 

W = ^n]) = ({qO,n,q/,n},qO,n,11«,{qO,«},^'-n) 

where |qo,„| = {y}, |q/,„| = 0, ri„ : y i-^ n, tr„: q/,„ h^0, andfr„: qo,„ ^ {(q/,„±,y, }). 

To compose i?/JmD and i^j„D, we first have to compute !H^„^'fy; by Def 14, ^m])'tv = 
(Qt,qo,ti'nt'{q/,t}>*''t) where Qf = {qo,t,q/,t} and 



qo,t ^ {x,y} 

,q/,t^{y} 



lit: 



X I— ^ m 
y I-)- ± 



qo,t ^ {(q/,m,x,y i-)-y)} 

q/.t^0 



By Def 15, o = {Q,, qo,t , ilo , {q/,n} , ^''o) where Q, = {qo,t , q/,t , qo,n , q/,n }, 



rio: 



y n 



and 



'qo.t^iiq/.t^x.y-^y)} 
q/,t'-*-{(qo,n>e.y'-^y)} 

qo,n^{(q/,«,y,-L)} 

lq/,«^0 



We now build i^j(„.„,n)D = [«](-^JmD °-^J«d); let q.v and q, be two new states with jq.,] = 
{x} and |q,| = 0, as prescribed by Def 17, we have ^{„.m„)^ = (Q[«i,q^,ri[„],{q,},/r[„j) 
where Q^j = (^U{q„qt} and the initial setting ri[„] by restricting rjo on |q,|,i.e. ri[„] : xi->- 
m; moreover. 



tr,. 



q* '-^ {(qo.t) «)^)}) where dom{o) = {x,y} and a(x) = y and a(y) = ★ 

q/," '-^ {(q/,»,-L)} 

,q^-^ft(q)> if q G Qt \ {q/,n} 



Further, by Def 16, i^((„.mn>)*D is obtained by adding two extra transitions qj i-)- 
{(q»,e,-L)} and q, i-)> {(qj,r>,ri[„])}. 

Finally, by Def 15, we obtain the HDS !^C([m({n.mn)Y\i follows. First, let i?^'^u = 
({qo,m> q/,m}> qo,m>ilm, {q/,,„}, be obtained as in (1 1) by defining IqJ) „| = {x'}, \a!fj = 
0,11m: x'H^OT,fr;„: q} „ 0, and /r;„ : q[,,„ {(q},„,x',±)}. Then, we seti?/J„((„.^)).j = 
° ^J(«.m«>J = (Q,,qo,m>il, {qi},f ) where Q,= Q(„] U {qo,m:q/,m} and 



X I— )• OT 
X I— )• OT 



'q H>fr[„](q), 
qo,m^f'"m(q/,m)U{(q/,„,e,±)} 

.q/,m ^ fm(q/.m) u{(q.,e,y ^ y)} 



if q € Q[«] 




I] : X !- - -)> m 
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We conclude with some final remarks. Equivalent definitions could have been adopted; 
for instance, T] above is not required to be injective (adding some non-determinism 
in Def 10) or some of the new states introduced by the constructions above could be 
avoided to obtain more compact HDS. We decided to use conceptually simpler con- 
structions instead of more effective, but more complex ones. 

8 Conclusion 

This paper developed the beginnings of a general theory of words with binders: nominal 
languages, nominal monoids, nominal regular expressions, HD-automata with stacks. 
We sketch some further work. 

Coming back to Table 1 further classes maybe relevant, for example words satisfy- 
ing Ax4-5 but not Ax 1-3; it will also be of interest to mix different binders each obeying 
its own axioms plus further axioms of their interaction. 

HD-automata with stacks are more powerful than necessary if one is only interested 
in recognising regular languages; a restricted class of HD-automata characterising reg- 
ular languages of m-words can be described; the same should be done for g-words, 
1- words, and s-words. 

We will also investigate the connections (cf. Example 2) of our nominal languages 
with languages (on infinite alphabets) without binders [1,17, 7, 2J. 

Further, closure properties and decidability results for these classes of automata 
should be studied; for verification purposes deterministic and nunimal automata will be 
of interest. 

Last but not least, case studies showing the relevance of this line of research to 
verification will have to be explored. 
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